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FILTER PROPERTIES OF 
LEAST SQUARES FITTED POLYNOMIALS 


Bodo Kruger 


SUMMARY 


The use of least squares fitted polynomials (LSP) as high and low pass 
filters is analyzed in this paper. 

A typical high pass filter application is noise analysis of tracking data. 
Equations are given for the estimation of degrees of freedom k needed in the 
LSP in order to "take out" the orbit so that the residuals (tracking data - LSP) 
represent the tracking data noise. As the LSP only contains k terms, the orbit 
is not completely "taken out" and what is left over of the orbit in the tracking 
data is called the modeling error 8 . It is shown that the standard deviation of 8 
is proportional to T k and Y < k > 


cr s ~ T k Y< k > 


where 

T = the length of the data stretch 
Y( k > = k th derivative of the "orbit". 

The modeling error is analyzed in some detail for a lunar orbit (see Figures 1 
through 5) and it is shown that k = 7 is needed for both range and range rate 
data if the standard deviations o- f and cr.. of the modeling errors have to satisfy 

a f < 10 m 


cr.. < Icm/sec 

and if the data stretch is 1/4 revolution, starting at the earth-moon line. 


A typical application of low pass filtering is data compression. Noisy data 
is substituted by one or more points of an LSP which has been fitted to the noisy 
data. The modeling error 8 in this application introduces a bias in the com- 
pressed data points. It is shown that the commonly used midpoint of the LSP is 
not optimal with respect to the bias. For instance, for k ~ 5, the bias may be 
reduced by a factor of 1.53 if the optimal point is chosen instead of the midpoint. 
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FILTER PROPERTIES OF 


LEAST SQUARES FITTED POLYNOMIALS 


1.0 INTRODUCTION 

In this report the filter properties of Least Squares Fitted Polynomials 
(LSP) are analyzed from the viewpoint of their use as high pass and low pass 
filters . 

In the application as a high pass filter, noise is removed from data con- 
taminated by noise by fitting a LSP to the contaminated data. The noise is con- 
tained in the residuals (contaminated data minus LSP) but some of the data 
"spills over" into the residuals. The data points may be thought of as being 
generated by a polynomial with infinitely many terms. Fitting a LSP with a 
finite number of terms to the data has to result in modeling errors. These 
modeling errors "spill over" into the residuals and it is therefore desirable to 
keep them small. The modeling errors can be made small by using a LSP with 
many degrees of freedom, i.e. many terms. On the other hand, the noise intro- 
duced by the computer increases with the degrees of freedom of the LSP. It is 
therefore desirable to determine the minimum number of degrees of freedom of 
the LSP with acceptable modeling errors. Expressions for the variance of the 
modeling errors are derived in Section 2 and their application to analysis of 
tracking data from a lunar orbit is demonstrated in Section 3. 

In the application of the LSP as a low pass filter , noisy data is replaced by 
the LSP. The modeling error shows up as a "bias" and the noise on the data 
introduces a statistical uncertainty to the LSP. Expressions for both the bias 
and the statistical uncertainty are given in Section 4, 


2.0 THE LEAST SQUARES FITTED POLYNOMIAL AS HIGH PASS FILTER 

2.1 Data Without Noise 

Assume that exact data points are generated by the polynomial 

Y. = b 0 + bjj + , . . b^j*' 1 + b k i k + . . . b^j 0 " 1 (2.1) 


and that a polynomial 



y, = a 0 + 3j i + ... + a k .!i k - 1 


( 2 . 2 ) 


with k degrees of freedom is fitted to Yj in the least squares sense. N equi- 
distant values of Y. are available and i and j stand for the integers from 1 to N. 
The sampling interval is normalized to X, which can be done without loss of 
generality. 

It is shown in Reference 1, Equation (10) that y L may be written 
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y\ can not be fitted exactly to Y l if k < n. 
The residuals or modeling error 





( 2 . 7 ) 


may be written 
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But for m < k we obtain from Reference 1, Appendix E, 
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It is shown in Appendix A that this expression also may be written 



( 2 . 11 ) 
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The variance E {8 3 } of the modeling error is 



In Appendix B it is shown that for large N 
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Insertion in Equation (2.13) yields 
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An alternate and useful expression for the variance is obtained by expressing 
the generating polynomial Y. , Equation (2.1), in a Taylors series 
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If the sampling interval is h , then 


(2.16) 


t = j h 

and thus by comparison with Equation (2.1) 
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For large N we may write the interval of observation T as 

T = N • h 
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The variance is then 
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In many cases only the first term needs to be considered and Equation (2.20) 
simplifies to 
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where 
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If Y (t) only contains even or odd terms k may be chosen so that Y (k) = 0 and 
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Sometimes it is desirable to consider a symmetric interval from -T to +T. 
The modeling error S. for the symmetric interval may be derived by the vari- 
able transformation 


x - t + T 


(2.23) 
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Thus if 


-T < t < +T 


then 


0 < x < 2T 


and the previously found equations for E {S 2 } are applicable to the x interval. 
Substituting Equation (2.23) into (2.16) we obtain 


or 


where 


Y = 
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Substitution Equation (2.25) into Equation (2.20) yields 
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where E*(S 2 ) deviates the variance of the modeling error S. for the interval 
from -T to +T. 

From Equations (2.21) and (2.26) it is seen that cr B is proportional to the 
length of the interval raised to power k and proportional to the k th derivative 
of Y if terms of higher order then k are negligible. 

2.2 Data With Noise 


Let y . be the observed values of the true data points Yj . If the data is con- 
taminated with noise e . then 


= Y. + 


(2.27) 


The residuals are 
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If the e. are uncorrelated stochastic variables with zero mean, then 
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because 8. is a polynomial in i according to Equation (2.11) 


N | 

Y 0 j 

Z_j i A 

j = i 1 


e j 


is then also a stochastic variable and 


. . No N , 

1 im J. y -1 y-» o j 

N - * 00 N Z_j |A| Z-j i A 

i=l j=l 1 


e s - 0 


In Reference 1, Appendix E, it is shown that 
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As it is immaterial whether the summation index is i or j , we thus have 
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Using Equation (2.20) the final result is obtained 
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The total variance is thus obtained by adding the noise variance to the model 
error variance. See also Reference 2. 


3.0 ANALYSIS OF RANGE AND RANGE RATE DATA 
FROM A LUNAR ORBIT 

In order to get a feel for the degree of polynomial needed for least squares 
fits to Lunar Orbiter data, the case of a circular lunar orbit is analyzed. 

From Figure 1 we obtain, neglecting the earth rotation 

r 2 = a 2 + b 2 -• 2ab cos cot (3.1) 


The application of Equations (2.20) through (2.26) requires a series expansion of 
the range r and the range rate r. The easiest way seems to be to expand cos cot 
into a series and then to extract the root. 

With 


a = 3.844 x 10 8 m 
b = 1.800 x 10 6 m 

ji - 4.903 x 10 “ 12 m 3 /sec 2 

co 2 = 8.406 x 10“ 7 (rad/sec) 2 
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Figure 1. Lunar Orbit Geometry 
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where r is in meters and t in seconds. The range rate r in m/sec is 
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The standard deviations <x r and ay of the modeling errors for range and 
range rate have been calculated for the lunar orbit using Equations (2.21), (2.22) 
and (2.26) both for the unsymmetric interval 0 to T and for the symmetric in- 
terval - T to +T. The results are summarized in Table 1 and are shown 
graphicallyin Figures 2 through 5. For the region of a T and ay shown in 
Figures 2 through 5, the one -term approximation leading to Equations (2.21), 
(2.22) and (2.26) is valid. For higher values of a r and a • the next term in the 
series expansions should be considered (see Section 2.1). 

In Figures 4 and 5 are shown two points obtained by computer simulation. 
Using the geometry shown in Figure 1, 75 values for r and r were calculated 
for the time interval -2220 sec < t < +2220 sec. A polynomial with k = 7 was 
fitted to the range data and a polynomial with k = 8 was fitted to the range rate 
data. The standard deviations of the residuals are in excellent agreement with 
the curves in Figures 4 and 5. 

The time for one orbit is r = 6,840 sec and for 1/4 of an orbit r /4 = 1,710 
sec. Assume we want to fit a polynomial to 1/4 of an orbit with the modeling 
errors 

a r < 10 m 
ay < 1 cm/sec 


Table 1 


The Standard Deviation of the Modeling Errors for Range 
and Range Rate for a Lunar Orbit 


k 

Unsymmetric Interval 
from 0 to T 

Symmetric Interval 
from -T to +T 

m 

°V 

m/sec 

cr 

r 

m 

cr. 

r 

m/sec 

2 

5.86 x 10" 2 T 2 

2.46 X 10" 8 T 3 


3.28 x 10“ 8 T 3 

3 

2.06 x 10~ 9 T 4 

4.10 X 10“ 9 T 3 

4.13 x 10" 9 T 4 

4 

2.58 X 10“ 10 T 4 

1.153 x 10" 16 T s 

3.69 x 10“ 16 T 5 

5 

5.78 x 10“ 18 T 6 

1.153 x 10~ 17 T 5 

3.08 x 10' 17 T 6 

6 

4.82 x 10" 18 T 6 

2.49 x 10" 2S T 7 

2.27 x 10' 24 T 7 

■ 

8.92 X 10“ 27 T 8 

1.775 X 10“ 26 T 7 

1.420 x 10" 25 T 8 

8 

5.56 x 10” 28 T 8 

4.20 x 10” 34 T 9 

1.195 x 10" 32 T 9 

9 

1.171 x 10“ 3S T 10 

2.35 x 10“ 35 T 9 

5.98 x 10“ 34 T 10 

10 



6.47 x lO” 41 ^ 1 

11 



2.99 x 10** 42 T 12 


For the unsymmetric case, 0 < t < 1710 sec we obtain from Figures 2 and 3 
that a polynomial with k « 7 is needed for both range and range rate. If the 
interval is symmetric, -855 < t < +855 sec, we see from Figures 4 and 5 that 
a polynomial with k = 7 is needed for the range and k = 6 for range rate. 
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NASA<GSFC~TAOS 

MISSION A TRAJECTORY ANALYSIS DIVISION 
■RANCH SSI DATE 1/17/AA 

■ Y >ODO KROGER PLOT NO, 1010 


Figure 2. The standard deviation cr for the modeling error from a least squares polynomial fit, 
with k degrees of freedom, to range data from a lunar orbit. The interval of observation starts at 
the earth-moon line and is of duration T sec. 
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NASA-GSKC-TBOS 

MISSION A TRAJECTORY ANALYSIS DIVISION 
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Figure 3. The standard deviation • a* r - for the modeling error from a least squares polynomial fit, 
with k degrees of freedom, to range rate data from a lunar orbit. The interval of observation ‘ 
starts at the earth-moon line and is of duration T sec. 
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k K7fk * 


POINT OBTAINED- 
BY COMPUTER 
SIMULATION 


(T IN SECONDS) 


NASA-GSPC-TSDS 

Mission • trajectory analysis division 

■NANCH SSI DATE 1/17/SS 

■Y NOOO KRUGER PLOT NO. 1032 


Figure 4. The standard deviation cr f for the modeling error from a least squares polynomial fit, 
with k degrees of freedom, to range data from a lunar orbit. The interval of observation is from 
-T to + T, symmetric around the earth-moon line. 
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4.0 THE LEAST SQUARES FITTED POLYNOMIAL AS LOW PASS FILTER 


4.1 Bias and Noise 


In this application of the least squares fitted polynomial die measured data 
points y. are substituted by the LSP y^ in order to eliminate measurement 
noise. The LSP is thus used as a low pass, filter. The problem in this applica- 
tion is to determine the difference 5. between the true data point and the 
LSP y. . 


s i = Y i - y* 


If the measured points y . are contaminated with noise g'j , then 
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The first term in this equation is "bias" due to the modeling error and the 
second term is the zero mean error due to measurement_noise. The bias term 
will be denoted 3. and the variance of the noise E {S i - o i } 2 . In Appendix C it 
is shown that for large N, 3. may be written 
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where P k are the Legendre polynomials in u and 
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If Y. is expanded in a Taylor series an alternate expression is obtained using 
Equation (2.18) and (2.19) 


P x - Y(k> + 2 (2k VI) { P k+ . + ( 2k + !) P J Y<k + 1) 

(4.4) 

+ 4 (2k + ' 1) 2 ~ ( 2k + ~ 3 ) { P x«+ (2k + 3)P ktl + (k + 2) (2k + 1) P k } Y<x +2 > + ... 



where 

T = the total length of the interval of observation 
y(k) = ^th derivation of Y 

T is in the same units as the variable Y is expanded in. 

If the e. are uncorrelated and have standard deviation cr € , then Equation (17) 
in Reference 1 is applicable and 

2 a e 

E { 5 i - S J = -jT { p 0 + 3 P? + + (2k + 1) P k 2 } (4.5) 
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where P k are the Legendre Polynomials in u as before, 

4.2 Data Compression 

A form of data compression is to substitute the N original data points by 
one or more_points from the LSP y., The compressed data points are subject 
to the bias 5 as given by Equations (4,2) or (4.4). Therefore, y. should be 
evaluated or Interrogated at points where S, is small. 

_From Equations (4.2) and (4.4) it is seen that the first term in the expansion 
of 3. is P k b k or P k Y (k) , The effect of this term can be eliminated by interro- 
gating y. at a point where P k = 0. For k 2 3 we have several roots to choose 
from. In order to determine which root to use, the next term in the expansion 
of S. is considered. This term is proportional to P ktl as can be seen from 
Equations (4.2) and (4.4). We thus choose the root for which j P k+1 | is smallest. 
The roots of P k and jP k + 1 j for k values from 3 to 5 are listed below. 


k = 3 


1 U | 

P 3 

1 P 4 1 

0 

0 

0.3750 

0.7746 

0 

0.3000 

li 



1 u | 

P 4 

i p 5 i 

0.3400 

0 

0.3294 

0.8611 

0 

0.2444 

k - 5 



\ u | 

p 5 

i p 6 i 

0 

0 

0.3125 

0.5385 

0 

0.2870 

0.9062 

0 

0.2049 


From the values of | P k+l | it is seen that the 'optimum root is the numerically 
largest root of P k = 0. 

A commonly used point of interrogation is the midpoint u = 0. It is clear 
from the above that this is a very poor choice for even k. For odd k some im- 
provement is obtained by choosing the optimum root rather than u = 0. For 
k = 3 the improvement is a factor of 1.25 and for k = 5 the improvement factor 
is 1.53. 
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because 


(A-l) 
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A 


V 


by definition. Thus 



* 

J 

A 


jk + v 4. |k + v = 


A 


k+v 


\ 


+ v 


. . A 


2k+ v~ 1 


* * ' 

\ 


: k-1 a A 

1 A k“l * * * ’ n 2k-: 


1 A 0 

i A t 


... (-l) k 
I A| 



;k-i a A 

1 k .~ 1 ’ * ’ n 2k“2 

; k+v A A 

1 "k+v • ft 2k+v-l 
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APPENDIX B 


EVALUATION OF 


N 


^1 i k+p l ' I i k+q l 

i = i 


y ii k +pi . ii k +<i| = y 

i= 1 i = l 


. . . A k _ 


i k 1 A k-1 . . , A 2k . 2 

i k+p A k+p . , . A 2k „ 1+p 


i k+q A, 


• \-i 


; k “l a 

1 A k-1 * 


k+ q 


• ^2k-2 


. A. 


2k + q 
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The determinants in the first column are all zero except the last one. Thus 


A 0 




N 

2~’|i k+p l • I i k+q l 

i 3 1 


For large N 


A| • 


k-l • ■ 

. A 2k _ 2 

A 2k-l+q 

‘k+p 1 ' 

• A 2k“l+p 

A 2k+p+q 


(B-2) 


A 


V 




N vn 

V + 1 


so that 


A A A 


1 1 

1 

A q ... A k+q 

• * • 


A k k + 1 + q 

* • * 

•r, 

• • • 
at jfc A 

- J^(l + k)2+p+q 

• # » 

• ♦ » 

1 1 1 

A k-1 A 2k-2 A 2k-l+q 


k •” 2k- 1 2k + q 

1 1 1 

A k+p * * ' A 2k-l+p A 2k+p+ 1 


k + 1 + p 2k + p 2k + 1 + p + q 



but 


1 i l 

2 * ' ’ k k + 1 + q 


1 i _j i 

2 3 ’ ’ ’ k + 1 k + 2 + q 


I i i 

k k + l 2k - i 2k + q 

II 11 

k + l+ pk + 2+ p 2k+p2k+l+p + q 


k + 1 + q 


k + 1 + q 
2 


k + 2 + q 
2 


k + 2 + q 
3 


(k + q) i 

(2k + q) ! (2k + l+p + q) 


2k + q 2k + q 

k k + l 


2k + 1 + p + q 2k + 1 + p + q 

k+l+p k+2+p 


k + 1 + q 
_ 

k + 2 + q 
k + l 


2k + q 
2k ~ 1 

2k + 1 + p + q 
2k + p 
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(k+p) (k + q) 
k + l+p 


(k+p) (k~l + q) 
2(k + 2 + p) 


(k+p) (1 + q) 
k (2k+p) 


(k“l + p)(k + q) (k~ 1 +p) (k- 1 + q) 
2 (k + l+p) 3 (k + 2+p) 


( k + q) ! 

(2k + q) ! (2k + l+p + q) 


(k- 1+p) (1 + g) 
(k+1) (2k+p) 




(1 + p) (1 + q) (1+p) (k“l + q) (1 + p) (1+q) 

k (k + l+p) (k + 1) (k + 2+p) (2k- 1) (2k+p) 


1 

.1 

2 


1 

2 

1 

3 


I 

k 

1 

k + 1 


[(k + p)!] 2 [(k + q)] 2 

p ! q! (2k + p) ! (2k + q) ! (2k + 1 + p + q) 


J. _J_ 1_ 

k k + 1 2k ** 1 


a 


(B-3) 
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so that for large N 


In the first step the rows are multiplied by factors so that "one 
in the last column and in the second step the bottom row is subtracted from the 
other rows. In Reference 1, Equation (C-7) it is shown that for large N 






EVALUATION OF 


* 



With v = 0 and the substitution 


N 


u + 1 


we obtain for large N 



1 


1 


1 

2 


1 

k 


u + 1 1 1 

2 2 3 


1 

k + 1 


<u + V\ 


1 

3 


1 

4 


1 

k + 2 


But 


'u + l\ k 1 1 

v 2 / k + 1 k + 2 



fu + \ \ 
\ 2 j 


k! 


2 k n (2 j - 


1) 


{P k + (2k - 1) P k _, + k(2k - 3) p k . 2 + ...} 


a k P k + a k „ 1 ?P k „ 1 +, a k _ 2 P k-1 


(C-l) 


where P k = P k (u) are the Legendre Polynomials. Thus 


— p + — p 

2 r i 2 


1 1 1 

— P -f._p -f — p 

6 r 2 2 i 3 0 


a k **k + a k-l ^k-1 + a k-2 ^k-2 + 


1 

2 


I 

3 


1 

2 

3 


I 

4 


_1 

k 


k + 1 


' k + 2 


k + 1 k + 2 * * * 2k 


Nk(k+1) 


(C-2) 


P 0 may be eliminated by subtracting the top row from the others , yielding 




1 


But for large N 


A| = | K [ N 2k 


so that 


k! 


2 k n ( 2 j - 1 ) 


P k N k 


(C-6) 


For v = 1 we obtain in the same manner 


k+ 1 


(k + 1) ! 


... k+l 

2 k+l n (2 j - i) 
1 


{P k+1 + (2k + 1) P k } N k+1 (C-7) 


and for v = 2 


k+2 


(k + 2) ! 


L k + 2 

2 k+2 n (2j -l) 


{P k+2 + (2k + 3) P k+1 + (k + 2) (2k + 1) P k } N 


k+2 


(C-8) 


